NTCIR-7 Patent Mining Experiments at RALI
نویسندگان
چکیده
We participated in the patent mining task at NTCIR7 workshop. Particularly, our experiments focus on English corpus. Based on the Indri search engine, we implemented a patent classification system, which is able to assign a research paper into the IPC system according to the annotated patents in the database. As the task is a cross-genre classification task, we tried several methods to bridge the gap between the research papers and patents. Unfortunately, most the methods do not produce consistent improvements.
منابع مشابه
KNN and Re-ranking Models for English Patent Mining at NTCIR-7
This paper describes our English patent mining system for NTCIR-7 patent mining task which maps a research paper abstract into IPC taxonomy. Our system is basically under the k-Nearest Neighboring framework, in which various similarity calculation and ranking methods are used. We employ two re-ranking techniques to improve the performance by the use of richer features. Our systems performed wel...
متن کاملUsing the Multi-level Classification Method in the Patent Mining Task at NTCIR-7
A patent includes a great deal of practical technical information, and plays an important role in promoting scientific development. The research on patent classification and retrieval has significant application value. A patent is a special technical text with strict hierarchical classification system and normalized structure, and there are a number of relations between patents and their consti...
متن کاملRALI Experiments in IR4QA at NTCIR-7
In this report, we examine what information retrieval techniques can help identify documents that contain answers to different types of question. In particular, we exploit different external resource according to the type of question. In particular, Wikipedia will be exploited for identifying personal names and their translation, as well as biography-related keywords. Google search engine is us...
متن کاملMulti-label Classification using Logistic Regression Models for NTCIR-7 Patent Mining Task
We design a multi-label classification system based on a machine learning approach for the NTCIR-7 Patent Mining Task. In our system, we employ a logistic regression model for each International Patent Classification (IPC) code that determines the IPC code assignment of research papers. The logistic regression models are trained by using patent documents provided by task organizers. To mitigate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008